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ABSTRACT 

Groups and clusters of galaxies hosting a quasar can be found through the 
detection of Lya absorption lines beyond the quasar redshift. The effect occurs 
whenever the distortion to the redshift distribution of Lya clouds induced by 
the cluster potential extends beyond the proximity effect of the quasar. Based 
on CDM cosmological models for the evolution of structure, we predict the 
probability for finding lines beyond the quasar redshift (^;abs > ^q) under the 
assumption that the physical properties of Lya clouds are not affected by fiows 
on large scales (^ Mpc) in the quasi-linear regime. If quasars randomly sample 
the underlying galaxy distribution, the expected number of lines with z^^s > zq 
per quasar can be as high as ~ 0.5 X [{dN j dz) j'ibQ] at z = 2^ where dN/dz is 
the number of Lya lines per unit redshift far from the quasar. The probability 
is enhanced if quasars typically reside in small groups of galaxies. A statistical 
excess of Lya lines is expected near very dim quasars or around metal absorption 
systems. Due to clustering, the standard approach to the proximity effect 
overestimates the ionizing background fiux at high redshifts by up to a factor 
of ~ 3. This result weakens the discrepancy between the deduced background 
fiux and the contribution from known populations of quasars. 
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Introduction 



Acceptable cosmological models for structure formation are designed to agree with large- 
scale-structure observations in the local universe. Despite the common calibration to fit the 
universe at present, these models can have very different predictions about its past history 
(e.g. Mo & Miralda-Escude f994). In this paper, we discuss a method to probe clustering 
of galaxies at high redshifts [z ~ 1-5) that could potentially test such models. The method 
makes use of the redshift distribution of Lya clouds in the vicinity of a quasar. 

If a quasar resides in a cluster or group of galaxies, its peculiar velocity may shift its 
apparent redshift below the redshift of its nearest Lya cloud. This effect is further enhanced 
by the gravitational potential of the massive host, which induces infall of distant Lya clouds 
and increases the number density of clouds out to scales of ~5 Mpc around the cluster. On 
the other hand, the quasar reduces the number of absorbers by ionizing clouds that are in 
physical proximity to it (Weymann, Carswell & Smith I98I; Carswell et al. 1982; Murdoch 
et al. 1986; Tytler 1987; Bajtlik, Duncan, & Ostriker 1988, hereafter BDO; Kovner & Rees 
1989; Lu, Wolfe, & Turnshek I99I, hereafter LWT; Bechtold 1994). Nevertheless, as we 
show later, the redshift distortion due to clusters can extend beyond the effective radius of 
the quasar proximity effect. 

The presence of an excess of C IV absorption systems near or beyond the emission line 
redshift of quasars has been discussed for two decades (see Foltz et al. 1987 for a review). The 
observed excess was attributed to absorption within clusters of galaxies or within the quasar 
host galaxies (Bahcall 1975; Weymann & Williams 1978). In the near future, spectroscopic 
data from the Keck telescope (Sargent 1994) and the Sloan Digital Sky Survey (Gunn & 
Knapp 1993) are expected to shed more light on the statistics of this phenomenon. In this 
work we use the current theory for structure formation in the universe in order to quantify 
the statistical effect that clusters should have on the redshift distribution of absorption 
systems (Sargent et al. 1980; Young et al. 1982). We prefer to focus on Lya rather than 
C IV absorption systems since Lya clouds are much more common and therefore sample 
better the cosmological mass distribution around clusters. In addition, the proximity effect 
of the quasar eliminates the contribution of clouds that physically reside inside the host 
cluster. This simplifies the theoretical analysis since it leaves out the unknown properties of 
absorbers inside virialized systems such as galaxies or clusters. The clustering signature is 
then dominated by the peculiar velocity shear on the scale of a few Mpc where the matter 
has not virialized yet. We consider the statistics of lines with arbitrary velocity shifts relative 
to the quasar out to — 2000km/s. 

The outline of this paper is as follows. In §2 we calculate the probability for finding 
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Lya lines above the quasar redshift. The calculation is done under the assumptions that 
quasars are drawn at random from the underlying distribution of either galaxies (~lO^^Af0) 
or small groups of galaxies (~lO^"^Af0), and that the physical properties of Lya clouds are not 
altered by cosmological flows on large scales (^ Mpc) in the linear and quasi-linear regimes. 
The clustering signature obviously depends on the assumed minimum mass of the quasar 
host and on the threshold for cloud destruction; we therefore present results for different 
sets of assumptions. The current distribution of cluster properties (Bahcall & Cen f993; 
Zabludoff et al. f 993) is evolved back in time using the Press-Schechter formalism (Press & 
Schechter f974) and the spherical collapse model (Gunn & Gott f972); a procedure known to 
be in good agreement with N-body simulations (Lacey & Cole f993 and references therein). 
Since the mass scales of interest are covered by existing galaxy surveys, we adopt a power 
spectrum of initial density perturbations that gives a good phenomenological fit to local 
cluster data (cf. Bahcall & Cen f993). Numerical results from this approach are presented 
in §3. Finally, §4 summarizes our main conclusions. 

2. Method 



Previous analyses of the redshift distribution of Lya clouds in the vicinity of a quasar 
(e.g. EDO, LWT, Bechtold f994) have assumed that the velocity difference between the 
absorbing Lya cloud and the quasar is a measure of the actual physical distance between 
the two. In this paper we include two sources of peculiar velocity that alter this assumption. 
First, we allow the quasar to reside in a virialized group of galaxies and to have a Gaussian 
probability distribution of peculiar velocities according to the velocity dispersion of this host. 
Second, we allow for an infall peculiar velocity and for a radial displacement of the Lya clouds 
far outside the host due to its gravitational pull. This infall also increases the number density 
of Lya clouds near the quasar. We treat the distribution of clustering properties at high 
redshifts according to the Press-Schechter formalism and approximate the infall dynamics 
by the spherical collapse model. 

We begin with a brief review of the spherical collapse solutions, as they play an important 
role in our analysis. In a bound system, the solution r[t) for the radius of a spherical shell 
with a mass M interior to it may be written as, 

r = A(I — cos rj) 
t = B{r] — sin?y), 

where A^ = GMB^ (Gunn & Gott 1972; Peebles 1980). The remaining degree of freedom in 
the choice of A and B may be set by picking the mean overdensity interior to the shell 
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S{R) = J^^j^d^x 6[x.) at a particular redshift. Here S[x.) = [6p[x.) — pb]/pb^ pj, is the 
background density, and R is the radius of the sheU. If the overdensity is Si at a high 
redshift Zi [rji ^1), then 

where Hq and Oq are the Hubble constant and density parameter of the background universe 
today. Note that this solution has the peculiar velocities of the growing mode. For an 
unbound system, we alter equation (1) to the usual hyberbolic solution and change equation 

(2) to 

Using the growing mode growth factor, one can calculate the r.m.s. fluctuations crM{zi) on 
a particular mass scale M at redshift Zi. This yields crM{zi){l + Zi) = (7^(0) for an Oq = 1 
universe and aMizi){l-\-Zi) = (Tm(O)-Di(Oq — I)/[2(0q — 1)/5] for an Oq < 1 universe, where 
Di{x) = I + 3/x + (3\/I + x/x"^'"^) ln(vl+~c — a/Jc) is the growth factor in an open universe 
(Groth & Peebles 1975). In this manner, one can connect the Gaussian initial conditions to 
the properties of the spherical solution. 

Given a particular background cosmology and a power spectrum of Gaussian initial 
perturbations, the Press-Schechter formalism and the spherical collapse model yield the 
distribution of object masses as a function of time. We assume that an object has virialized 
when its outermost shell reaches half its turn-around radius (i.e. rj = 37r/2). For a given 
cosmology, there is one value of 6i[l + Zi) that results in a collapse at the epoch of the 
quasar's emission tq. The ratio of this value and crM{zi){l + Zi) yields the critical peak 
height threshold Pcitq^M) needed to get a collapse on a given mass scale by the time tq. 
The Press-Schechter model then predicts the number density of collapsed objects n at time 

dn 2 ( dXnVr 
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We set the velocity dispersion of a virialized object as follows. The specific binding 
energy of the system at turn- around is roughly — GAf/rta, where M is the mass of the system. 
The turn-around radius rta is found by setting the collapse time of the object [r] = 37r/2) 
to tq. Assuming that the system relaxes into an isothermal sphere with a one-dimensional 
velocity dispersion cr, the specific kinetic energy of the final virialized object is |(T^. The 
virial theorem implies that this kinetic energy is equal to the absolute value of the total 
energy, yielding 
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One should note that estimates of this type appear with a variety of proportionahty constants 
in the hterature (Narayan & White 1987; White et aL 1993), but the exact number depends 
on the true density profile and collapse history of the object. 

For each object, we assume that the probability of hosting a quasar is proportional to 
its mass, but in addition we assume that there is a minimum mass for a quasar host, chosen 
to be either 10^^ M© or 10^"^ M© (all masses refer to the epoch of the quasar's emission). 
The former choice corresponds to galactic masses, which appear necessary to power the 
quasar (Turner 1991), while the latter choice corresponds to masses of small galaxy groups, 
which are suggested by the clustering properties of quasars (Bahcall & Chokshi 1991). For 
a given quasar, the probability distribution of host masses is proportional to Mdn/dM. By 
determining the host mass, we also find the velocity dispersion of the host, allowing us to 
randomly select the line-of-sight peculiar velocity of the quasar from a Gaussian distribution. 
This procedure is clearly appropriate for groups of galaxies, but it is inappropriate if the 
host is just a single galaxy with the massive quasar black hole at its center. We ignore the 
latter case because 10^^ M© host systems could still be composed of unmerged components 
at high redshifts and because the velocity dispersions of single galaxies are quite low and 
make a small contribution to the clustering signature. 

Let us now turn to the peculiar velocities of the Lya clouds. Far away from the quasar, 
the clouds are uniformly distributed with no significant correlations (Sargent et al. 1980, 
Webb & Barcons 1991). We therefore assume that prior to the application of the quasar's 
ionizing fiux, the clouds trace the mass distribution around the cluster. This means that 
before gravitational interactions cause the region around the host to contract, the clouds 
are uniformly distributed. Previous studies of quasar spectra have found that far from the 
quasar the number of Lya lines per unit redshift is well described by a power-law, 

dN/dz = Ao(l + zy, (6) 

with 7 = 2.4±0.3 and Aq ^ 3 for a rest equivalent width W > 0.36A (BDO, LWT), although 
more recently Bechtold (1994) found a shallower redshift dependence with 7 = 1.9 it 0.3 for 
W > O.32A and 7 = 1.3 ± 0.2 for W > 0.16A. The values of Aq and 7 depend on the 
sensitivity and the spectral resolution of the observations. In this work, we use dN/dz 
values of 350 at z = 2 and 700 at z = A as rough estimates for the expected density of lines 
in forthcoming observations with the Keck telescope (Sargent 1994). According to equation 
(6), these values correspond roughly to a rest equivalent-width threshold of ~0.02A or a 
column density threshold of ~4 X 10^^ cm~^. Our results are linearly proportional to these 
choices for dN/dz. We then find the number of lines per unit distance using the relation 
dN/dr = [Hq/c)[1 -\- ZQ){dN/dz), where Hq is the Hubble constant at the redshift of the 
quasar and c is the speed of light. 
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To evaluate the gravitational effect of the host cluster on the Lya cloud distribution, 
we consider the quasar host and its environment at very early times and fix the origin of 
the coordinate system at the center of the host. The host material comes from a sphere 
centered at the origin with a mass equal to the final host mass. We assume that no 
Lya clouds ultimately survive inside the host because of its hot virialized gas and the intense 
ionizing radiation of the quasar. Thus, the material that eventually makes up the Lya clouds 
originates outside this sphere. Starting from the outer radius of the host sphere rhost, the 
clouds are assumed to be uniformly distributed at their initial positions and to behave as 
test particles. We then evolve a particular cloud position forward in time using the spherical 
solution. In doing so, we need only specify the mean overdensity interior to the radius of 
the cloud. The spherical solution yields the position and velocity at the final time (i.e. the 
time of observation). The final cloud velocity, which includes both the Hubble fiow and 
the peculiar velocity, is added to the quasar's peculiar motion within the host in order to 
determine the net difference between the quasar redshift and the cloud absorption redshift. 
At the same time, the final position of the cloud is used to determine its fate with respect 
to the ionizing UV fiux from the quasar. 

The mean overdensity interior to the radius of the cloud rdoud may be found from the 
properties of Gaussian random fields (Bardeen et al. 1986; Eisenstein & Loeb 1995). The 
overdensity interior to the sphere of radius rhost = ^i is exactly the value of 6 that produces 
a collapse at the time tq. Given this constraint, we wish to find the overdensity interior to 
some other radius rdoud = ^2- This overdensity has a Gaussian distribution with a mean 
and a variance that can be determined analytically. An important simplifying aspect of the 
problem is that we seek only to calculate the ensemble- averaged probability for observing 
a line at a particular redshift. This allows us to consider each radius separately instead 
of constructing a full realization of the spherical density profile around the host. We are 
free to consider the overdensity interior to rdoud as constrained only by the existence of the 
host and independent of the results for other clouds. In doing so, we neglect the possibility 
that the mean overdensity might fail to be a monotonically decreasing function of radius, 
a situation that would lead to shell crossing. The effect of the latter complication on our 
results is expected to be rather small for the power spectra considered here. If one wished to 
study the correlations between cloud positions, then one would need to compute realizations 
of actual density profiles and compile a catalogue of absorption line spectra. 

Given a power spectrum of initial density perturbations P{k), the r.m.s. mass fiuctuation 
6M/M in a sphere of radius r is 
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where ji is the spherical Bessel function. We let 6i and 62 be the overdensities interior to 
radii ri and r2, respectively, and then define Pj = 8^ j (TM{rj), where j = 1,2. Given a value 
for z/i, the distribution of V2 has a mean of 7i2'^i and a variance of 1 — 7^2 (Bardeen et al. 
1986; Appendix A of Eisenstein & Loeb 1995), where 
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If the power spectrum is a power law P{k) oc A;", then one can simplify these results 
so that only the ratio of the mass scales enters the calculation. Manipulation of the above 
integrals yields the following distribution for 62^ 



62 = Sip 
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where p = r2/ri, and (^ is a random number that is Gaussian distributed with zero mean 
and unit variance. The function 



gn{p) = Mp 
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is normalized by J\f to have (/^(l) = 1. In this work, we use the complete (i.e. not power-law) 
power spectra in the Press-Schechter formalism because such detail is important for getting 
the correct relative weights between single galaxies and massive groups of galaxies. However, 
for the purpose of computing the density profiles around hosts we find it sufficient to use 
a simple power-law approximation to the power spectrum, which best describes the slope 
near the characteristic mass-scale. In particular, we use a spectral index of n = —1.5, which 
matches the index in cold dark matter cosmologies for M sa 10^"* M©. 

In addition to inducing peculiar velocities, the gravitational pull of the quasar host 
increases the density of material and therefore the number density of Lya clouds in the 
neighborhood of the host out to scales of order 5Mpc. The statistical properties of the 
clouds should not be disturbed in the quasi-linear regime [6 ^ 1), as the change in their 
large-scale environment relative to the background universe is not significant. Under these 
conditions, the clouds drift smoothly together with their surrounding environment. However, 
as nonlinearities develop on Mpc scales the clouds may be destroyed or at least change 
their physical properties (e.g. neutral fraction, column density, size) due to changes in the 
pressure of the surrounding gas. In our analysis, we use the turn-around time [r] = tt) as the 
transition point between the quasi-linear and nonlinear stages [Sta. ~ 4.6) and then assume 
conservatively that all clouds are destroyed after turn-around due to nonlinear hydrodynamic 
or thermal processes. The significance of this assumption is partially compensated by the 
fact that the quasar ionizes most of the Lya clouds out to a distance of a few Mpc around the 
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host cluster anyway. Thus, most of the absorbing clouds participate in quasi-linear regime 
flows on large scales (^ Mpc), well outside the virialization shock of the host cluster or 
the boundary of its hot gas. It therefore seems plausible to use the above assumption as a 
working hypothesis. We prefer to confine our discussion to this assumption, since any of its 
potential modifications would add model-dependent parameters of uncertain hydrodynamics 
and thermodynamics to our analysis. We thus postulate that up until turn-around the 
line of sight to the quasar intersects more clouds as the material around the cluster drifts 
inwards relative to its Hubble fiow. Using the spherical collapse model, the corresponding 
enhancement factor in the number of lines per unit radius is simply (rdoud/?"//)^, where r^ 
is the position that a cloud would have had in a pure Hubble fiow. As a reference to the 
significance of our assumption, we will still show some results (cf. the uppermost curve 
in Figs. 1-5) for the situation where the clouds maintain their properties down to the 
virialization radius of their shell (ryir = hrta 



ta ; • 



The distribution of Lya clouds around the quasar is diluted by the proximity effect 
(Carswell et al. 1982), namely the photoionization of clouds by the strong UV fiux of the 
quasar (EDO; LWT; Kovner & Rees 1989; Bechtold 1994). The effect depends on the ratio 
between the UV fiux of the quasar at the cloud's position and the fiux of the diffuse UV 
background at the quasar's redshift. If the luminosity of the quasar per unit frequency at 
the Lyman limit is Zj,, then the fiux at the cloud's radius rdoud is F^ = Liy/Airrl^^^^. We 
define uj = F^ /AirJiy, where Jj, is the diffuse UV background fiux per unit frequency at the 
quasar redshift at the Lyman limit. At ionization equilibrium, the neutral hydrogen column- 
density of each cloud is diluted by a factor (1 -|- cu) relative to its value in the absence of the 
quasar fiux. Based on the power-law distribution of column densities for Lya clouds, the 
proximity effect then implies that the number of lines above some column-density threshold 
is suppressed by a factor (1 -|- c<;)~°'^ (EDO). To find the luminosity of the quasar, we use 
the observed fiux f^ at the appropriately shifted Lyman-limit frequency and multiply by 
47r/(l -|- zq) and the square of the luminosity distance in the relevant cosmology (Weinberg 
1972). Because we wish to consider quasars with different fiuxes as well as allow for the 
uncertainty in the UV background, we will do our calculations in terms of the variable 

/ /j, \ /10~^^ ergcm~^s~^ Hz~^ sr~^\ 

•' " (^ 10-27 erg cm-2s-iHz-ij V X J ' 

Previous studies of the proximity effect have used quasars with values of / in the range 1-10. 

Based on the proximity effect and the two sources of peculiar velocity, we calculate the 
mean number of Lya lines per unit velocity, dN/dv. The velocity difference v is measured 
relative to the emission redshift of the quasar, so that lines with negative velocities have 
-^abs > Zq and are only possible with a fortuitous coincidence of peculiar velocities. The 
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function dN/dv is obtained by integrating over the probability distributions for the quasar 
host masses, the quasar pecuhar velocities inside the host cluster, the possible initial radii for 
the Lya cloud, and the possible overdensities within the cloud radius. The results depend 
on the value of /, the background cosmology, and the assumed power spectrum of initial 
density perturbations. 

3. Results 

We study two cosmologies, each of which gives a rough fit to the observed large scale 
structure in the local universe. They both involve cold dark matter (CDM) with a power 
spectrum of the form 

P{k) oc ^ (12) 

{l + [ak + {bkf/^ + {ckyY} 

where a = {6A/T)h-^ Mpc, b = {3.0/T)h-^ Mpc, c = {1.7 /T)h-^ Mpc, and ly = 1.13 
(Efstathiou, Bond, & White 1992). For our fiat cosmology, we use = 1, F = 0.5, and 
CTg/j-i Mpc = 0.6. For our open cosmology, we use = 0.3, F = 0.2, CTg/i-i Mpc = 0.8, and a 
zero cosmological constant. The value of the Hubble constant cancels out of the calculation, 
except for the indirect dependence in F. All masses in this paper refer to the total mass, 
including dark matter. 

We present results for two different redshifts, z = 2 and z = A^ and for various values of 
/. We consider three different cases. The first is referred to as "Proximity Only", where we 
set all the peculiar velocities to zero and include only the proximity suppression (1 +c<;)~°'^, 
with Tcioud inferred from the Hubble fiow. This case has been used in previous analyses 
of the proximity effect. It describes a situation where quasars preferentially reside in low 
mass systems with negligible peculiar velocities. The second case is referred to as "Infall, 
M > 10^^ M©" and includes all the sources of peculiar velocity that were discussed in §2. 
In this case, we assume that the minimum mass of the quasar host is 10^^ M©. The third 
case is referred to as "Infall, M > 10^"^ M©" and is the same as the second case except 
that the minimum mass of the host is 10^"^ M©. This means that quasars are not allowed to 
exist in field galaxies and instead reside in systems at least as massive as groups of galaxies. 
This condition increases the velocity dispersion that the quasar can have within the host 
system and extends the gravitational effect of the host on the Fya clouds. The third case 
is suggested by the fact that the correlation function of quasars is closer to that of groups 
than to that of field galaxies (Bahcall & Chokshi 1991). 

In Figures 1-4, we present plots of the expected number of lines per unit velocity dN/dv. 
Positive velocities mean z^^s < zq. The density of lines dN/dz far away from the quasar 



-10- 



was assumed to be 350 for z = 7. and 700 for z = A (see discussion following Eq. 6); all 
predictions in these figures and in the tables are linearly proportional to these values. Each 
figure represents a different combination of redshift and cosmology. Within the figures, each 
set of three curves shows the results for three values of /. The case / = means that the 
proximity effect has been turned off; the "Proximity Only" case then represents the base 
number of lines that would be expected far from the quasar. As / gets larger, the quasar is 
brighter and there are fewer lines near the quasar due to the increase in its ionizing fiux. In 
addition to the above cases, we also show with the dot-dash line the extreme case of infall 
with M > 10^^ M© and / = in which we allow the clouds to proceed past turn-around all 
the way to the virial radius of their shell [r] = 37r/2). Note that while this case produces a 
marked increase for v < 1000 km/s, it makes no change relative to the "Infall, M > 10^^ M© " 
case for v > 1000 km/s. 

In light of recent spectroscopic observations with the Hubble Space Telescope (Bahcall 
et al. 1993), it is interesting to consider the clustering signature at low redshifts. The shapes 
of the dN/dv curves at z < 1 are similar to those presented in Eigures 1 and 2, but because 
of the scarcity of Lya lines at low redshifts, the normalization of the vertical axis is very 
different. Not only does the abundance of clouds fall steeply at low redshift, but also the 
equivalent width threshold of HST is much greater than that of the Keck telescope. Eor HST 
observations at 2; sa 1, we estimate dN/dz < 20, which means that the number of lines with 

V < km/s is undetectably small. In addition, the background fiux J^ estimated by Kulkarni 
& Eall (1994) at 2; < 1 is smaller by 1-2 orders of magnitude than the estimated background 
at 2 < z < 4:. This means that for a given intrinsic quasar luminosity, the proximity effect 
extends out to a much larger radius of infiuence at z < 1^ further concealing the effects 
of clustering. Put another way, Eigures 1-4 show that the number of lines with negative 
velocity shifts is significantly smaller than the deficit caused by the proximity effect. Since 
the detection of the proximity effect at low redshift (Kulkarni & Eall 1994) is only marginal, 
the predicted clustering signature is most likely undetectable with HST for z < 1. 

In general, the inclusion of infall and peculiar velocities increases the number of lines 
relative to the proximity effect. This is primarily caused by the increase in the number 
density of clouds around the host. Eor bright quasars, the higher ionizing fiux extends the 
proximity effect to a large radius. The surviving Lya clouds only feel a weak gravitational 
pull; moreover, the peculiar velocity of the quasar within its host is only a small fraction 
of the Hubble velocity of the clouds. Thus, the distribution of the clouds is only weakly 
perturbed from that of the proximity effect alone. Dim quasars, however, cannot ionize the 
clouds to a sufficiently large radius so as to hide the effects of infall and peculiar velocities. 
These quasars develop a tail of negative velocity lines as well as a significant excess of lines for 

V ^ 2000 km/s. Note that in the limit of / — )• 0, the number of Lya lines per unit redshift 
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ls not conserved because the infall allows a gathering of clouds from the two dimensions 
transverse to the line of sight; this Is In difference from the effect of the quasar motion Inside 
the host, which merely shifts the velocity distribution of lines without changing their total 
number. As revealed by the excess bump In the number of Lya lines shown In Figures 1-4 
for / = 0, It Is even likely to find an "antl-proxlmlty" effect near very dim quasars. 

We expect that a similar statistical excess of Lya lines will occur near galaxies along 
the line of sight, such as the ones associated with metal absorption systems (Barcons & 
Webb 1990). In these cases, the proximity effect of the galaxies should be much weaker 
than that of quasars and the extent of the excess bump should be closely described by the 
/ = curves. We Investigate this by assuming that the metal-line systems are metal-rich 
clouds In the halos of galaxies (Steldel & Sargent 1992). We neglect the Impact parameter 
between the line of sight and the center of the host and then allow the metal-line system 
Itself to move with the velocity dispersion of the host halo. The expected dN/dv distribution 
of Lya lines relative to the host Is then found by taking the dN/dv curve for the quasar case 
(Fig. 1-4) and adding It to Its refiectlon around v = 0. The result Is shown In Figure 5. 
Here we consider both M > W^ M® and M > W^ M® for = 0.3, as well as M > 10^^ ^^ 
for = 1. We also show the result for M > 10^^ M© and = 0.3 If one allows the clouds 
to evolve beyond turn-around and fall back to the vlrlal radius of their shell. In all cases, 
one finds an appreciable Increase In the numbers of Lya clouds around metal-line systems. 
Integrating the difference between the curves In Figure 5 and the base expectations (dotted 
line) yields an excess of 1.5 Lya lines per system In the M > 10^^ M© case, 2.1 lines In 
the M > W^Mq, n = 0.3 case, 0.9 lines In the M > 10^^^^^ q = i case, and 8.6 lines 
In the "Beyond Turn-Around" case. As before, these numbers are linearly proportional to 
[{dN/dz)/350]. 

This effect has been searched for using the cross-correlation of metal-line systems with 
Lya lines (Barcons & Webb 1990), but no significant signal was found. The number of excess 
lines predicted by our model for this study may be estimated as follows. The study Included 
51 metal-line systems, many around z = 2 but a few at higher redshlft. Adopting z = 2^ the 
equivalent width threshold of 0.36A yields dN/dz = 40 (BDO). If 1.5 x [{dN/dz)/350] excess 
lines are expected around each metal-line system, then around 9 extra lines would be present 
In the sample, presumably a detectable number. However, the Lya line that Is associated 
with the metal-line system Itself (which, of course. Is not Included In the cross-correlation) 
tends to be broad and therefore obscures other lines within several hundred km/s of the 
associated line (Barcons & Webb 1990). As can be seen from Figure 5, removing the range 
—300 km/s < V < 300 km/s cuts the signal by about a factor of two. If one uses = 1 rather 
than = 0.3, then the signal Is further reduced by another factor of two, yielding an excess 
of 2-3 lines, which Is marginally compatible with the null result of the cross-correlation study. 
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The more extreme "Beyond Turn-Around" model is ruled out by these observations. Studies 
with higher resolution and lower equivalent width threshold should strongly constrain these 
models; continued null results would require lower masses for the metal-line systems, or the 
destruction of the Lya clouds at a lower density threshold. 

The above results for dN/dv are ensemble averages in which we have integrated over 
several unknown parameters, such as the mass of the host. This means that for a particular 
quasar, the positions of the absorption lines will be correlated. For example, if the quasar 
resides in a massive cluster this would favor having multiple low velocity lines, but if the 
quasar resides in a field galaxy, then the chance of having low velocity lines is small. 
Therefore, if one line is seen with u < 0, the chance of seeing another low velocity line 
is increased. 

Tables 1 and 2 present the expected number of lines per quasar with negative velocites. 
Table 1 is for z = 2^ and Table 2 is for z = A. For each statistic, we show results in the 
two cosmologies. The = 1 cosmology, with its more recent structure formation, produces 
a slightly smaller tail at u < 0. Because we remove any clouds that evolve past turn-around 
relative to the quasar host, there would be no negative velocity lines were it not for the 
peculiar velocity of the quasar within its host. Relaxing the condition that no absorption 
clouds survive beyond turn-around and instead allowing them to survive down to the virial 
radius of the shell produces a stronger negative velocity tail. For bright quasars, the number 
of lines with v < are doubled and the number with v < — 500km/s are tripled; for dimmer 
quasars, the increase is even larger. The increase is not only caused by the fact that the 
clouds beyond turn-around have a higher redshift than the quasar host but also by the large 
density enhancements (~180 above background) that are reached as the shell containing the 
clouds approaches virialization. 

Table 3 presents the total number of lines expected between v = and v = 3000 km/s 
for z = 2 and a variety of / values. The numbers listed are the difference between the 
calculated number and the base number that would be expected within the same velocity 
range far away from the quasar. Table 4 presents the results for z = A. The above velocity 
range was picked in order to roughly maximize the signal of the deficit of lines compared to 
the noise of the base level. In principle, the signal-to-noise ratio can be further improved by 
picking different ranges for different values of / and z. 

Tables 1-4 show minor differences between the two CDM cosmologies. The results for 
= 1 show less infall and smaller peculiar velocities, since in an = 1 universe structure 
forms later than in an open universe (Richstone, Loeb, & Turner 1992). We also tested 
the sensitivity of the results to changes in the assumed spectral index n = —1.5 around the 
host mass-scale by trying values of n = —2 and n = —1. As expected, n = —2 yields a 
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stronger infall signal, while n = —1 results in a weaker one. For the production of lines with 
u < 0, the differences are less than 20%. For the number of lines between and 3000 km/s, 
the n = —2 spectrum produces a significantly larger excess because it makes the density 
fiuctuations much larger at radii of order 5Mpc. 

Detection of a deficit of lines below the base expectations has been taken as evidence 
for the proximity effect and then used to determine the UV background fiux (BDO, LWT, 
Bechtold 1994). The addition of peculiar velocities changes the predicted deficit of lines, 
although for large values of / the difference is small. Therefore, one should use bright 
quasars (with / ^ 1) to determine the UV background fiux. For dim quasars (/ ^ 1), 
neglecting peculiar velocities would lead to a significant overestimate of the UV background. 
To get the size of this effect, one should use the number in Table 3 or 4 for a particular 
value of / in one of the "Infall" cases and then find the value of / in the "Proximity Only" 
case that gives the same deficit. The ratio of the two / values estimates the error in the 
inferred UV background. Applying this procedure to Table 3 suggests that the standard 
approach to the proximity effect overestimates the background fiux by up to a factor of ~ 3. 
For example, a deficit of about 1.6 lines would indicate / = 1 with infall but only / sa 0.4 
without infall, hence the UV background fiux would have been overestimated by a factor 
of 2.5 if infall was ignored. In making this estimate, we have only considered lines with 
V > because candidate Lya lines with v ^ have typically been marked as unidentified 
and omitted. In addition, the uncertainties in the quasar redshift are another source of 
error in the inferred UV background (Fspey 1993; Bechtold 1994). The effect of infall would 
be particularly pronounced in samples of radio-loud quasars, which are known to reside 
in rich cluster environments (Yee & Green 1987). This clustering signature has important 
implications as it tends to lower the inferred background fiux down to a level that could 
be provided by the known populations of quasars (Miralda-Fscude & Ostriker 1990; Madau 
1992; Fall & Pei 1993; Meiskin & Madau 1993). 

While more massive hosts are more effective at producing peculiar velocity shear, the 
small probability that the quasars actually reside in such hosts keeps these mass scales from 
dominating the results. For / = 3 at 2; = 2, the contribution to the number of lines with 
u < for hosts of W^Mq, W^Mq, lO^^M®, and W'^Mq per host site per logM are in 
the ratio 1.0 : 3.4 : 13.5 : 53.2. This indicates that for brighter quasars the high mass 
systems are much more efficient at producing negative velocity lines. However, when we 
weight these contributions by the actual numbers of such hosts for = 0.3, we get the ratio 
1.0 : 3.6 : 8.3 : 1.9. Systems with masses between 10^^ M© and 6 X 10^"^ M© provide most 
of the signal. Indeed, the fact that there is a difference between the 10^^ M© and 10^"^ M© 
minimum mass cases demonstrates that the massive ^ 10^"* M© clusters are not dominating 
the results. If one assumes that quasars reside in lower mass [M sa 10^^ M©) hosts, then the 
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small mass of these hosts will produce little deviation from the usual proximity effect. In a 
similar fashion, any inaccuracies in the Press-Schechter approximation that yield the wrong 
ratio between the number density of field galaxies and groups of galaxies will alter the results 
roughly linearly in the ratio between the number densities of these two mass scales. It is 
worth noting that for / = 0, even small mass scales are capable of producing an appreciable 
number of lines near u = 0, but these lines are the first to disappear when the proximity 
effect is turned on. 

A related concern is that at high redshift the number density of quasars may exceed 
the number density of objects with masses greater than 10^"^ M©. At z = 2^ we find the 
comoving number density of objects with masses greater than 10^"^ M© to be 9 X 10~^ h^ Mpc""^ 
in the open cosmology and 8 X 10~^/i"^ Mpc""^ in the fiat cosmology. At 2; = 4, we find 
8 X 10~^h^ Mpc""^ in the open cosmology and 1.5 X 10~'^h^ Mpc""^ in the fiat cosmology. Since 
the comoving number density of quasars at the relevant magnitude range is ~10~^/i"^ Mpc""^ 
(Hartwick & Schade 1990; Schmidt, Schneider, & Gunn 1991), these results suggest that 
quasars at 2; = 4 in a fiat cosmology must have a minimum host mass less than 10^"^ M©. 
Furthermore, to make the = 1 results in the tables match those of the = 0.3 cosmology 
at a given value of /, one has to increase the minimum host mass, thus pushing against 
the limited number density of massive hosts in the fiat universe. When we apply our model 
to mixed dark matter (MDM) cosmologies (Klypin et al. 1993; Klypin et al. 1994; Mo & 
Miralda-Escude 1994; Ma & Bertschinger 1994), we run out of hosts with masses greater 
than 10^^ M© as we approach z = A. One is therefore forced to consider lower mass hosts, 
which strongly reduces the infall effects relative to the proximity effect. Therefore, models 
like MDM that form structure late should be well described by the "Proximity Only" case 
at high z and will produce few, if any, negative velocity lines. 

4. Conclusions 

In this paper we have quantified the signature of early clustering on the redshift 
distribution of Lya lines around the quasar redshift. We have shown that the inclusion 
of peculiar velocities and density enhancements in the neighborhood of quasar hosts leads to 
an excess of lines for redshifts within 0.01(1 + zq) of the quasar redshift (i.e. v < 3000 km/s) 
and allows some lines to have redshifts higher than the quasar. The effect is primarily caused 
by the peculiar velocity shear and the corresponding overdensities on large scales (^ Mpc) 
and is dominated by the contributions of groups of galaxies rather than by rich clusters. In 
the linear regime of structure formation {Sp/ p ^1), the environment of the clouds does not 
change significantly relative to the smooth background. In our calculations, we assumed that 
the statistical properties of Lya clouds are not altered even in the quasi-linear regime as they 
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drift smoothly together with their large-scale environment, but that the clouds are entirely 
destroyed in the nonlinear infall phase. The transition point between the quasi-linear and 
nonlinear stages was primarily assumed to be turn- around relative to the quasar host. The 
actual fate of Lya clouds in the nonlinear infall stages depends on the physical mechanism 
responsible for the confinement of the clouds, which is at present uncertain (Cen et al. 1994, 
and references therein). 

The statistics of lines with negative velocity shifts relative to the quasar provides a 
measure of structure at high redshifts (1 ^ ^; ^ 5). If quasars typically reside in small 
groups of galaxies (~ lO^"^Af0), then the clustering signature should already show-up in 
samples of a few quasars observed with the Keck telescope (cf. Tables 1-4). The association 
of quasars with groups is suggested by their own clustering properties (Bahcall & Chokshi 
1991) and their cross-correlation with galaxies (Hartwick & Schade 1990; EUingson, Green, 
& Yee 1991). A hint for unidentified absorption lines, which could in principle be Lya with 
-^abs <^ zq, is already found in existing data sets (Bechtold 1994). Weaker effects could 
be detected with larger quasar samples, such as provided by the Sloan Digital Sky Survey 
(Gunn & Knapp 1993). The clustering signal should appear more frequently in samples of 
radio-loud quasars, since these objects are known to reside in richer cluster environments 
than radio-quiet quasars (Yee & Green 1987). Indeed, an excess of associated C IV absorbers 
was found in a sample of radio-loud quasars (Anderson et al. 1987) and was not seen for 
radio-quiet objects (Young et al. 1982). 

The main practical obstacle towards measuring the clustering signature is the 
observational uncertainty in quasar redshifts. The significance of the uncertainty is 
emphasized by the steep exponential decline in the number of lines at negative velocity 
shifts (cf. Figs. 1-4); small shifts in the velocity threshold can cause a significant change in 
the expected number of lines. A quasar redshift cannot be determined accurately from its 
Lya emission line, since the blue side of the line is distorted by the Lya forest. However, 
the C IV emission line can set the quasar redshift to within ib300km/s (Tytler & Fan 1992; 
Laor et al. 1994). Infrared observations with high spectral resolution of the [0 III] narrow 
emission line can further improve the redshift determination of the quasar to better than 
ib50km/s. Narrow lines are thought to originate from the host galaxy and therefore provide 
the relevant redshift for our analysis (Vrtilek & Carleton 1985). 

The identification and interpretation of Lya absorption lines beyond the quasar redshift 
could be complicated by various factors. First, such lines may be explained as metal 
absorption lines. However, based on the background frequency of metal absorption lines, the 
probability that a metal line from a different redshift will randomly appear within 1000 km/s 
of the Lya emission line of the quasar is very low. In addition, this potential confusion can 



-16- 



be resolved by tracing the Ly/3 counterparts ol the Lya lines. Second, a negative velocity 
line may be interpreted as being unrelated to cosmological peculiar velocities but rather as 
being caused by Lya absorption from a cloud inside the host galaxy. If an absorption cloud 
has a large relative velocity due to the fact that it resides within the broad line region of the 
quasar, then it is likely to have a metalicity higher than solar (Hamann & Ferland 1992) and 
its Lya redshift should show-up also in metal absorption lines. If associated metal lines are 
not found, the remaining possibility is that the cloud resides in a metal poor environment 
bound within the host galaxy, in which case the velocity shifts are limited to ^ 300 km s~^ 
and can be distinguished from cluster velocities. 

The Lya clouds have a characteristic response time of ~10'*yr/(l + u) to variations in 
the quasar flux. If quasars vary considerably on this timescale, then the observed redshift 
distribution of Lya lines may correspond to a different value of / than the apparent quasar 
fluxes (BDO). While this may complicate somewhat the conversion of dN/dv to constraints 
on cosmological models, it will still allow the clustering signature to appear in all cases 
where the proximity effect is only secondary to the redshift distortion caused by clustering 
(cf. Tables 3 and 4). 

Clustering at high redshifts has other implications aside from the existence of lines with 
negative velocity shifts. First, the redshift distribution of Lya lines should show small non- 
zero correlations along any random line of sight due to the existence of mass concentrations 
(Sargent et al. 1980; Webb & Barcons 1991). The detection of these correlations may 
prove to be more difficult than finding Lya lines beyond the quasar redshift since quasars 
preferentially reside in dense environments. However, near the redshift of very dim quasars 
or metal absorption lines, a significant statistical excess of ^ 1.5 X [[dN/dz)/350] Lya lines is 
expected at z = 2 (cf. Fig. 5). Fxisting samples (Barcons & Webb 1990) are already at the 
threshold of testing this signature, while higher resolution and more sensitive data available 
with the Keck telescope should easily reveal this excess. A second implication of clustering 
is that the standard approach to the proximity effect overestimates the ionizing background 
fiux at high redshifts, especially in samples of faint radio-loud quasars. For published samples 
(BDO, LWT, Bechtold 1994), the clustering effect could lower the inferred background fiux by 
up to a factor of ~3 (cf. Tables 3 and 4). This tends to weaken the discrepancy between the 
deduced background fiux and the contribution from known populations of quasars (see also 
Fspey 1993; Bechtold 1994). It is possible to reduce the clustering effect on the estimation of 
the proximity effect by considering intrinsically bright quasars; the values of the three cases 
in Tables 3 and 4 converge as / increases. 

Finally, we emphasize that the numbers in this paper were derived for specific (CDM) 
cosmological models and would be significantly lower in models with later structure formation 
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(e.g. MDM). The strength of a detected clustering signature can be used to infer a lower 
limit on the number density of groups of galaxies at high redshifts and to discriminate among 
these models. 

We thank J. Bahcall, A. Dobrzycki, A. Laor, E. Maoz, W. Sargent, C. Steidel, J. Webb, 
and especially J. Ostriker for useful discussions. D.J.E. was supported in part by a National 
Science Foundation Graduate Research Fellowship. 
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Case 


/ = o 


/ = 0.1 


/ = 0.3 


/ = 1 


/ = 3 


/ = 10 


V < -500km/s 
M > W^Mq 


n = 


0.3 


0.050 


0.044 


0.037 


0.026 


0.016 


0.008 


= 


1.0 


0.027 


0.022 


0.017 


0.011 


0.006 


0.003 


V < Okm/s 
M > W^Mq 


= 


0.3 


0.56 


0.45 


0.35 


0.23 


0.13 


0.064 


= 


1.0 


0.45 


0.35 


0.26 


0.16 


0.088 


0.041 


V < Okm/s 
M > W^Mq 


= 


0.3 


0.88 


0.79 


0.67 


0.48 


0.30 


0.15 


= 


1.0 


0.77 


0.69 


0.58 


0.40 


0.24 


0.12 



Table 1: The expected number of lines per quasar with a sufficiently negative velocity. The 
redshift of the quasar is 2. The numbers are proportional to [[dN/dz)/350]. 



Case 


/ = o 


/ = 0.1 


/ = 0.3 


/ = 1 


/ = 3 


/ = 10 


V < -500km/s 
M> 10^2 Mq 


= 0.3 


0.11 


0.040 


0.022 


0.010 


0.005 


0.002 


= 1.0 


0.077 


0.021 


0.011 


0.005 


0.002 


0.001 


V < Okm/s 
M> 10^2 Mq 


= 0.3 


1.71 


0.53 


0.28 


0.13 


0.062 


0.027 


= 1.0 


1.55 


0.44 


0.23 


0.10 


0.049 


0.021 


V < Okm/s 
M> 10^3 Mq 


= 0.3 


3.00 


1.48 


0.88 


0.43 


0.21 


0.091 


= 1.0 


2.85 


1.46 


0.87 


0.43 


0.21 


0.091 



Table 2: The expected number of lines per quasar with a sufficiently negative velocity. The 
redshift of the quasar is 4. The numbers are proportional to [(dN/dz)/ 700]. 
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Case 


/ = o 


/ = 0.1 


/ = 0.3 


/ = 1 


/ = 3 


/ = 10 


Proximity 
Only 


= 0.3 


0.0 


-0.83 


-1.39 


-2.40 


-3.79 


-5.76 


= 1.0 


0.0 


-0.86 


-1.45 


-2.49 


-3.92 


-5.91 


Infall 
M > 10^2 Mq 


= 0.3 


0.40 


-0.01 


-0.53 


-1.60 


-3.12 


-5.28 


= 1.0 


-0.02 


-0.43 


-0.96 


-2.01 


-3.51 


-5.60 


Infall 
M > 10^3 Mq 


= 0.3 


0.52 


0.23 


-0.21 


-1.21 


-2.73 


-4.96 


= 1.0 


-0.07 


-0.34 


-0.77 


-1.74 


-3.21 


-5.35 



Table 3: The change in the expected number of lines with < v < 3000 km/s from the 
number expected far from the quasar: 10.5. The quasar is at redshift 2. The numbers are 
proportional to [[dN/dz)/350]. 



Case 


/ = o 


/ = 0.1 


/ = 0.3 


/ = 1 


/ = 3 


/ = 10 


Proximity 
Only 


= 0.3 


0.0 


-9.4 


-14.6 


-21.4 


-27.1 


-31.2 


= 1.0 


0.0 


-9.4 


-14.5 


-21.3 


-26.5 


-31.2 


Infall 
M > W^Mq 


= 0.3 


0.58 


-7.4 


-13.0 


-20.3 


-26.3 


-30.8 


= 1.0 


-0.40 


-8.0 


-13.4 


-20.5 


-26.4 


-30.8 


Infall 
M > W^Mq 


= 0.3 


0.80 


-6.0 


-11.5 


-19.1 


-25.6 


-30.4 


= 1.0 


-0.71 


-6.8 


-12.1 


-19.4 


-25.7 


-30.4 



Table 4: The change in the expected number of lines with < v < 3000 km/s from the 
number expected far from the quasar: 35.0. The quasar is at redshift 4. The numbers are 
proportional to [(dN/dz)/ 700]. 
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FIGURE CAPTIONS 



Figure 1: The number density of Lya lines per unit velocity in units of (10"^km/s)~^ for 
the = 0.3 cosmology at z = 2. The results are shown for three values of /, each with 
curves for the two "Infall" cases and for the "Proximity Only" case. Negative velocities mean 
-^abs > zq. The dot-dashed curve shows the f = 0, M > 10^^ M© result for the case where 
the Lya clouds are not destroyed until they reach their virialization radius. The numbers 
on the vertical axis are proportional to [[dN/dz)/350]. 

Figure 2: The same as figure 1, but for = 1 and z = 2. The numbers on the vertical axis 
are proportional to [[dN/dz)/350]. 

Figure 3: The same as figure 1, but for = 0.3 and z = A. The numbers on the vertical 
axis are proportional to [(dN/dz)/ 700]. 

Figure 4: The same as figure 1, but for = 1 and z = A. The numbers on the vertical axis 
are proportional to [(dN/dz)/ 700]. 

Figure 5: The number density of Lya lines per unit velocity in units of (10"^ km/s)~^ around 
a metal-line system at z = 2. The solid and dot-dashed curves are for the "Infall" model with 
/ = and minimum masses of the host of the metal-line system of 10^^ M© for = 0.3 and 
= 1, respectively. The short-dashed curves is for the "Infall" model with a minimum host 
mass of 10^^ M© and = 0.3. The long-dashed curve assumes M > 10^^ M© and = 0.3 
and allows the clouds to survive past turn-around and until virialization. The dotted curve 
is the base number density of clouds in the absence of any clustering. The numbers on the 
vertical axis are proportional to [[dN/dz)/350]. 
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